Improving Cross-Validation Classifier Selection Accuracy through Meta-Learning
نویسنده
چکیده
In order to choose from the large number of classification methods available for use, cross-validation error estimates are often employed. We present this cross-validation selection strategy in the framework of meta-learning and show that conceptually, metalearning techniques could provide better classifier selections than traditional cross-validation selection. Using various simulation studies we illustrate and discuss this possibility. Through a collection of datasets resembling real-world data, we investigate whether these improvements could possibly exist in the real-world as well. Although the approach presented here currently requires significant investment when applied to practical applications, the concept of being able to outperform cross-validation selection opens the door to new classifier selection strategies.
منابع مشابه
Classification Accuracy and Model Selection in k-Nearest Neighbors Classifiers for Data Driven Learning
Pattern classification is a core research area and a main task in pattern recognition. A classifier induced by machine learning algorithms maps an unlabeled instance to a label using internal data structures. In this paper we experiment first by changing the k value of nearest neighbors from 3 to 15 and compare the accuracy of two classifiers on various training and test sets. The results show ...
متن کاملDetection of malicious attacks by Meta classification algorithms
-------------------------------------------------------------------ABSTRACT--------------------------------------------------------------We address the problem of malicious node detection in a network based on the characteristics in the behavior of the network. This issue brings out a challenging set of research papers in the recent contributing a critical component to secure the network. This ...
متن کاملImproving Accuracy in Intrusion Detection Systems Using Classifier Ensemble and Clustering
Recently by developing the technology, the number of network-based servicesis increasing, and sensitive information of users is shared through the Internet.Accordingly, large-scale malicious attacks on computer networks could causesevere disruption to network services so cybersecurity turns to a major concern fornetworks. An intrusion detection system (IDS) could be cons...
متن کاملRanking of Classifiers based on Dataset Characteristics using Active Meta Learning
Classification is a machine learning technique which is used to categorize the different input patterns into different classes. To select the best classifier for a given dataset is one of the critical issues in Classification. Using cross-validation approach, it is possible to apply candidate algorithms on a given dataset and best classifier is selected by considering various evaluation measure...
متن کاملBayesian Model Averaging for Improving Performance of the Naïve Bayes Classifier
Feature selection has proved to be an effective way to reduce the model complexity while giving a relatively desirable accuracy, especially, when data is scarce or the acquisition of some feature is expensive. However, the single selected model may not always generalize well for unseen test data whereas other models may perform better. Bayesian Model Averaging (BMA) is a widely used approach to...
متن کامل